Quantization of Speech Features: Source Coding

نویسندگان

  • Stephen So
  • Kuldip K. Paliwal
چکیده

In this chapter, we describe various schemes for quantizing speech features to be used in distributed speech recognition (DSR) systems. We have analyzed the statistical properties of MFCCs that are most relevant to quantization, namely the correlation and probability density function shape, in order to determine the type of quantization scheme that would be most suitable for quantizing them efficiently. We also determine empirically the relationship between mean squared error and recognition accuracy in order to verify that quantization schemes, which minimize mean squared error, are also guaranteed to improve the recognition performance. Furthermore, we highlight the importance of noise robustness in DSR and describe the use of a perceptually weighted distance measure to enhance spectral peaks in vector quantization. Finally, we present some experimental results on the quantization schemes in a DSR framework and compare their relative recognition performances.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Vector-Predictive Multi-Mode Matrix Quantization Approach for Parametric Speech Coding

In parametric speech coding, the accuracy of parameter quantization has a significant effect on speech quality. In this paper, we present a flexible and high-fidelity multi-mode quantization approach that combines the beneficial features of predictive vector quantization and matrix quantization. As an example, the proposed technique is employed in quantization of the power component in a wavefo...

متن کامل

Source and channel coding for remote speech recognition over error-prone channels

This paper presents source and channel coding techniques for remote automatic speech recognition (ASR) systems. As a case study, Line Spectral Pairs (LSP) extracted from the 6th order allpole Perceptual Linear Prediction (PLP) spectrum are transmitted and speech recognition features are then obtained. The LSPs, quantized using first-order predictive vector quantization (VQ) at 300 bps, provide ...

متن کامل

Joint source-channel coding of LSP parameters for bursty channels

This work present a joint source-channel technique based on Channel Optimized Vector Quantization (COVQ) for transmission over bursty channels applied to LSP parameters coding. The bursty channel is modeled as a Finite State Channel (FSC) with two states. We call Bursty COVQ (BCOVQ) to the resulting quantization technique. The case in which channel state information is only available at the rec...

متن کامل

Improving the Error Resilience of G.711.1 Speech Coder with Multiple Description Coding

This thesis devises quantization and source-channel coding schemes to increase the error robustness of the newly standardized ITU-T G.711.1 speech coder. The schemes employ Gaussian mixture model (GMM) based multiple description quantizers (MDQ). The thesis reviews the literature focusing on GMM based quantization, MDQ, and GMM-MDQ design methods and bit allocation schemes. GMM-MDQ are then des...

متن کامل

A packetization and variable bitrate interframe compression scheme for vector quantizer-based distributed speech recognition

We propose a novel packetization and variable bitrate compression scheme for DSR source coding, based on the Group of Pictures concept from video coding. The proposed algorithm simultaneously packetizes and further compresses source coded features using the high interframe correlation of speech, and is compatible with a variety of VQ-based DSR source coders. The algorithm approximates vector qu...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2014